Learning to Resolve Natural Language
نویسنده
چکیده
We analyze a few of the commonly used statistics based and machine learning algorithms for natural language disambiguation tasks and observe that they can be re-cast as learning linear separators in the feature space. Each of the methods makes a priori assumptions, which it employs, given the data, when searching for its hypothesis. Nevertheless, as we show, it searches a space that is as rich as the space of all linear separators. We use this to build an argument for a data driven approach which merely searches for a good linear sepa-rator in the feature space, without further assumptions on the domain or a speciic problem. We present such an approach-a sparse network of linear separators, utilizing the Winnow learning algorithm and show how to use it in a variety of ambiguity resolution problems. The learning approach presented is attribute-eecient and, therefore, appropriate for domains having very large number of attributes. In particular, we present an extensive experimental comparison of our approach with other methods on several well studied lexical disambiguation tasks such as context-sensitive spelling correction, prepositional phrase attachment and part of speech tagging. In all cases we show that our approach either outperforms other methods tried for these tasks or performs comparably to the best.
منابع مشابه
Ensemble Learning for Low Resources Prepositional Phrase Attachment
Prepositional phrase attachment is a major disambiguation problem when it’s about parsing natural language, for many languages. In this paper a low resources policy is proposed using supervised machine learning algorithms in order to resolve the disambiguation problem of prepositional phrase attachment in Modern Greek. It is a first attempt to resolve prepositional phrase attachment in Modern G...
متن کاملInteractively Picking Real-World Objects with Unconstrained Spoken Language Instructions
Comprehension of spoken natural language is an essential skill for robots to communicate with humans effectively. However, handling unconstrained spoken instructions is challenging due to (1) complex structures and the wide variety of expressions used in spoken language, and (2) inherent ambiguity of human instructions. In this paper, we propose the first comprehensive system for controlling ro...
متن کاملDetermination of Features for a Machine Learning Approach to Pronominal Anaphora Resolution in Basque
In this paper we present the preliminaries for a machine learning approach to resolve the pronominal anaphora in Basque language. In this work we determine the appropriate features to be used in this task.
متن کاملLearning to Disambiguate Natural Language Using World Knowledge
We present a general framework and learning algorithm for the task of concept labeling: each word in a given sentence has to be tagged with the unique physical entity (e.g. person, object or location) or abstract concept it refers to. Our method allows both world knowledge and linguistic information to be used during learning and prediction. We show experimentally that we can handle natural lan...
متن کاملWiddowson and Classroom Discourse
Drawing on recent developments in linguistic description and applied linguistics, it can be concluded that learning a language necessitates getting to know something and being able to do something with that knowledge: competence, and performance. Structural approach to language description attaches importance to the former; communicative approach to the latter. Appropriate classroom discours...
متن کاملTo Appear Coling-acl '98: Workshop on Usage of Wordnet in Natural Language Processing Systems Incorporating Knowledge in Natural Language Learning: a Case Study
Incorporating external information during a learning process is expected to improve its eeciency. We study a method for incorporating noun-class information , in the context of learning to resolve Prepo-sitional Phrase Attachment (PPA) disambiguation. This is done within a recently introduced architecture , SNOW, a sparse network of threshold gates utilizing the Winnow learning algorithm. That ...
متن کامل